Qubit Channels Can Require More Than Two Inputs to Achieve Capacity 
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We give examples of qubit channels that require three input 
states in order to achieve the Holevo capacity. 



The Holevo capacity C ($) of a channel $ is defined as 
the supremum over all possible ensembles £ — {^j,Pj} 
(consisting of a probability distribution -Kj and set of den- 
sity matrices pj), of the quantity 

x(£) = x({*i,Pi}) = sMp)} -XVj-spfo)] (i) 

j 

where p = J2j n jPj i s the average input, and £(7) = 
— Tr7log7 denotes the von Neumann entropy. Thus, 
C($) = sup £ x(£)- It has been shown @@ that C($) 
is the maximum information carrying capacity of a chan- 
nel restricted to product inputs, but permitting entan- 
gled measurements. C($) > Cshan(^), where the clas- 
sical Shannon capacity Cshan(^) describes the informa- 
tion carrying capacity of a channel when output mea- 
surements (as well as input ensembles) are restricted to 
products. (See, e.g., [|]||@] for precise definitions.) We 
will sometimes use subscripts to denote the supremum 
of (Q) restricted to a particular class of ensembles; in 
particular, we write C„($) to denote the restriction to 
ensembles of n states. It is well-known |ij that for qubit 
channels the maximum can be achieved with an ensemble 
containing at most four states. 

In general, a qubit channel maps the Bloch sphere to 
an ellipsoid |]U. If the channel is unital, i.e, if $(/) = I, 
then the ellipsoid is centered at the origin, and the ca- 
pacity is achieved with a pair of orthogonal inputs whose 
images, which are also states of minimal entropy ||, are 
the endpoints of the major axis of the ellipsoid. For non- 
unital channels the ellipsoid is displaced from the origin, 
and examples are known for which the capacity is 

achieved with two non-orthogonal inputs which are not 
mapped onto states of minimal entropy Here we present 
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examples of non-unital channels which require three input 
states to achieve capacity. Furthermore these channels 
are non-extreme points in the set of all qubit channels, 
and this property is essential to our construction. 

To motivate our strategy, we consider the capacity of 
two well-known channels which have rotational symmetry 
about an axis of the Bloch sphere, and we maximize (|l|) 
over ensembles consisting of a pair of states on a line 
either parallel or orthogonal to this axis. 

First, let $£> denote the shifted depolarizing channel 
which contracts the Bloch sphere to a sphere of radius p 
and shifts it up until it touches the unit sphere, i.e, if p 
is written in the form p — ^ [J + w ■ en , then 

$r>(p) = \ [I + (pwi,pw 2 , (1 - p) + pw 3 ) ■ cr] . (2) 

When p = 0.5, the image of the Bloch sphere satisfies 

x 2 +y 2 + (z-i) 2 = i (3) 

as shown in Fig. [I], and the poles p = | [I ± 03] are 
mapped to the states h \I + C3] and hi which have 
entropy and 1 respectively (using base 2 for loga- 
rithms). If we restrict the input ensemble to a vertical 
line, the x-Quantity ([!]) is maximized by a convex com- 
bination of the poles, so that the average output state 
$rj(p) = \ \l + z<J, i\ lies on the z-axis. One finds that 
this vertical capacity, which we denote Cy, is achieved 
when the output average is at z = 0.6 and its value is 

CV=sup{s(i[/ + za 3 ]) -(1-z)} 
= s(j[l+ 0.6cr 3 ] j - 0.4 w 0.32193. 

(This is also the capacity of a quantum-classical 
channel $qc> discussed below, which maps p 1— > 
|[J + (\ + 1^3)173].) If, instead, we restrict to 
convex combinations of states on a horizontal line, the 
restricted horizontal capacity Ch — 0.2144 is achieved 
when the output average is at the midpoint of the line 
with z = 0.474. Numerical studies confirm that Cy is 
the unrestricted maximum of (Q). 

Next, let $Amp denote the amplitude damping channel 

*Am P (p) = \[I+ {Vpw 1 ,y/pw 2 , (1 - p) + pw 3 ) ■ a]. (4) 

When p = 0.5, the poles are again mapped into the states 
Tj\l + 03] and i/ respectively. However, the image of the 
Bloch sphere is now the ellipsoid 
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shown in Fig. ||. Schumacher and Westmoreland |Tl| l 
studied this channel and observed that its maximum ca- 
pacity of 0.4717 is attained along a horizontal line whose 
image is at height z = 0.596. 

These results suggest that we begin with a shifted de- 
polarizing channel (Q) and continuously deform it into an 
amplitude damping channel (Q) by stretching the sides to 
obtain a channel of the form 



*str(/o) =h[l+ {swi,sw 2 , (1 - m) + M™3) ■ c] 



(G) 



with p < s < JJi. Such a channel is shown in Fig. ^[ As 
s increases from p to ^fp there should be a point at which 
the capacities restricted to vertical and horizontal lines 
become equal. Each of these two-state ensembles, which 
we denote £ y and £ h respectively, defines a unique point 
on the z-axis, namely the height of the average output 
state. If we now take the average of these two ensembles, 
the resulting ^-quantity is 

x [\{£v+£h)] =\{C v + C h ) (7) 
+S[^( PV + P h)\ - \S[*{pv)\ - \S[${p H )]. 

Since the entropy is strictly concave, x[^(£y will 
be strictly greater than \{Cy + Ch) = Cy = Ch un- 
less $tpv) — &(ph)- We have verified that <&str(pv) 7^ 
^Sti(pn) for the crossings associated with a range of val- 
ues of p, including p — 0.5. Since the average ensemble 
\{£y + £h) contains four states, this implies that the 
4-state capacity C4 is strictly greater than the restricted 
two-state capacity Cy = Ch- However, the four input 
states used here all lie in the same plane, in fact on a cir- 
cle of the Bloch sphere. The corresponding circle of den- 
sity matrices lies within a 3-dimensional subspace. By a 
straightforward adaptation of the general arguments in 
jlj , the maximum capacity on a circle is achieved with at 
most three states. Numerical studies of examples of this 
type have confirmed that the 3-state capacity is strictly 
greater than the (unrestricted) 2-state capacity, and that 
C 2 < C4 = C3. Note that even when the 4-state expres- 
sion for x is always strictly less than C3, the restricted 
3-state and 4-state capacities, C3 and C4 respectively, are 
equal since one can always add a fourth state with arbi- 
trarily small probability to the optimal 3-state ensemble. 

To complete the argument above, we need to show that 
the 2-state capacity C 2 actually equals the restricted ca- 
pacity Cy = Ch- Suppose instead that there is an en- 
semble on a skew line for which the x-quantity (jlj) exceeds 
Cy = Ch- Then by symmetry, there will be another 
ensemble (obtained by rotating 180° around the z-axis) 
with the same \- Unless their lines cross along the z- 
axis, and unless the average ensemble occurs at exactly 
this crossing, one will again be able to average the two 
ensembles (as shown in Fig. 0) to obtain a 4-state % value 



which is strictly larger. This argument shows that, unless 
there is a special degeneracy, the 4-state capacity always 
exceeds the 2-state capacity for channels of this type, i.e., 
those with equal vertical and horizontal capacity. 

The stretched channel (^]) was studied numerically for 
p = 0.5. When s = 0.6015, CV($ str ) = C H ($ Str ) = 
0.32193 with average output states at z = 0.6 and z — 
0.58 respectively. Thus, $str(pv) ^Str(pjy), implying 
that C4 > C2 by the argument above. 

Detailed numerical studies were then carried out for 
s = 0.6, yielding C 2 ($ S tr) = C v {$ str ) = 0.32193. The 
maximum capacity C($str) = 0.32499 is achieved with a 
three state ensemble consisting of the input state (0, 0, 1) 
with probability p — 0.4023, and two symmetric states 
(±.93681,0, -.34984) each with p = .29885 (where we 
denote states by vectors in R 3 via the correspondence 
p = i[7 + w • <t]. The optimal inputs and their im- 
ages are shown in Fig. I .) Although the differ ence 
C3(<I>str) - C2( < &str) = 0.003 is small, a three state en- 
semble is definitely needed to achieve the maximum. The 
Shannon capacity was also computed and shown to be 
achieved with the same two-state ensemble that gives Cy. 

Some insight, and a mechanism for constructing ad- 
ditional examples, can be obtained by considering the 
so-called quantum-classical (QC) and classical-quantum 
(CQ) channels introduced by Holevo ||. For qubits, both 
QC and CQ channels contract the Bloch sphere to a line. 
Up to a rotation, a QC channel has the form 

with | *3 ] + \p\ < 1, and its image states are classical in the 
sense that they are diagonal . When | £3 1 = 1 — | p \ , the fine 
is translated parallel to itself until it touches the Bloch 
sphere. In the examples considered above, the vertical 
capacity is equivalent to that of a QC channel that is 
extreme in the sense of reaching the Bloch sphere. 

Similarly, up to a rotation, a CQ channel has the form 

®cq(p) = \ [I + (h,t 2 ,t 3 + pw 3 ) ■ a] 

with |<i| 2 + \t 2 \ 2 + (\t 3 \ + \p\) 2 < 1. When t x or t 2 ^ 0, 
states on the image line do not commute and retain 
quantum features. Indeed, channels of the form 5 [/ + 
y/l — p 2 a 1 + /i 1030-3] were the first ||U for which the 
Holevo capacity was shown to strictly exceed the Shan- 
non capacity. The horizontal capacity we have considered 
is essentially the capacity of a non-extreme CQ channel 
whose image lies on a line of the form 5 ll+vwxo'i +£303] 
with \u\ — 2sy^3~(I — £3) and s is as in @. 

Additional examples of channels which require three 
inputs to maximize capacity can be constructed by de- 
forming shifted depolarizing channels (^) in various ways. 
One can effectively stretch by taking a convex com- 
bination with the amplitude damping channel ^Amp to 
obtain a channel of the form (Iff). This channel can also 
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be written as the convex combination of an amplitude 
damping channel with a suitable QC channel. Further- 
more, the channel (0) can be effectively squeezed down 
by taking a convex combination of <f>£> with a suitable 
CQ channel to obtain a channel of the form 

$Sqz(p) = ||/ + (nw!,qw 2 , (1 - n) + qw 3 ) ■ a]. (8) 

This channel is shown in Fig. ^ for p, — 0.5, q — 0.435. 

The behavior observed above for the stretched channel 
(^) with p = 0.5 is generic over a range of values of p, 
i.e., there is a value of s for which Cy = Ch, and a 
small interval for which C3 > C2. This was confirmed 
by additional numerical studies for p = 0.8. When s — 
0.84, the capacity of C = 0.62088 is achieved with the 
three state ensemble of (0, 0, 1) with p = 0.34415, and the 
two equiprobable states (±0.942895,0,-0.333091). The 
two state capacity C2 = 0.61823 is achieved with the 
orthogonal inputs (0,0, ±1). 

Similarly, for a squeezed channel of the form @, there 
should be a value of q for which Cy — Ch and a small 
interval for which C3 > C%. Numerical studies were 
again conducted with p — 0.5. When q = 0.43535, 
CV($ Sqz ) = CV/($ Sqz ) = 0.21325, suggesting that C 4 > 
C2. This channel was studied in detail for q = 0.435 with 
results similar to those for the stretched channel, but 
smaller effect. The capacity of C = 0.2140 is achieved 
with the three state ensemble (0, 0, 1) with p — 0.3310 
and two symmetric states (±.9534, 0, —.3017) each with 
p = 0.3345. The two state capacity of C 2 = 0.2132 is 
achieved with a non-orthogonal pair of horizontal inputs 
and is therefore greater than the Shannon capacity of 
Cshan = 0.2128 which is again achieved with a pair of or- 
thogonal inputs. The difference C3($sqz) — CM^Sqz) = 
0.0008, although small, is greater than the difference 
C 2 ($sqz) - Cshan(^Sqz) = 0.0004 by a factor of two. 

In general, to optimize (Q) one seeks ensembles whose 
outputs are near the Bloch sphere, where the second term 
is small, but whose average is near the origin where the 
first term achieves its maximum of 1. For unital maps, 
both requirements are compatible and achieved with a 
two-state ensemble whose image lies along a major axis 
of the ellipsoid. Symmetry may permit, but can never 
require, ensembles with more than two inputs. More- 
over, any deformation of a sphere which yields a unique 
major axis, also gives a unique optimal ensemble. For 
non-unital maps, the generic situation is still a two-state 
ensemble, whose image is on a line segment determined 
by the competing terms ([l]). More inputs are required 
when, as in the examples above, two quite differently 
oriented line segments corresponding to QC and/or CQ 
channels emerge with nearly equal two-state capacities. 

We expect to also find qubit channels which require 
four states to maximize the Holevo capacity. However, 
constructing an example by further deformations of the 



channels above will require a carefully balanced asym- 
metric shifting and squeezing. Either stretching or shift- 
ing alone appears to break the balance and return to a 
two-state channel. Moreover, care must be taken to en- 
sure that the deformed ellipsoid is one whose parameters 
satisfy the complete positivity conditions, as discussed in 
. Work in this direction is underway. 

There is another expression for the Holevo capacity, 
discovered independently by Ohya, Petz and Wanatabe 
H and Schumacher and Westmoreland Jl(|[ll|], in terms 
of the relative entropy H(P, Q) = TrP log P - TrP log Q. 

C($) =sup 7 tf[$( 7 ),<i>(p*)] (9) 

where p* = J2j n jPj i s the average state for the optimal 
ensemble. As an immediate corollary, one finds that the 
output states in the optimal ensemble are equi-distant 
from the output average in the sense of relative entropy, 
i.e., for the optimal ensemble H[$(pi), <&(p*)] = C(<&) for 
all pi. Although the relative entropy is not a true metric, 
it can be a useful measure of the "distance" between two 
states. Because determining the states which achieve ca- 
pacity can be numerically delicate, the equidistance prop- 
erty of optimal ensembles provides a useful criterion for 
verifying our numerical work. 

Moreover, one expects the symmetry properties of the 
channel to be reflected in the optimal ensemble. For the 
examples considered above, if one assumes that the op- 
timal ensemble will be a triple in a given plane with the 
North pole as p\, then the ensemble is completely de- 
termined by the condition that p* is on the z-axis and 
H[$( P1 )MP*)] = H[^(p 2 ),^(p*)] = H[$(p 3 ),$(p*)}. 
For channels of the form (||) the condition fi > q im- 
plies that the optimal ensemble will lie in the x-z plane. 
Channels of the form (JsJ) are symmetric with respect to 
rotation around the z-axis, which implies that it suffices 
to restrict attention to the x-z plane; however, any plane 
containing the z-axis would also suffice. In addition, 
when C3 > C 2 the capacity can also be achieved (not 
just approached) with a 4-state ensemble. For example, 
when p = 0.8, the two symmetric states can be replaced 
by the three states (±^0.9534, ±0.9534, -0.3017), 
(0, -0.9534, -0.3017), each with probability 0.223. 

To summarize, it has been conjectured that the Shan- 
non capacity of any qubit channel is achieved when the 
inputs are strings formed from products of two orthog- 
onal pure states. By contrast, as Fuchs Q showed, 
there are channels whose Holevo capacity is achieved 
with two non-orthogonal input states. This means that 
the capacity of such a channel is enhanced when inputs 
are strings formed from products of two non-orthogonal 
states {po, pi}, provided entangled measurements can be 
used on the outputs. We have established that there 
are situations in which entangled measurements can en- 
hance the capacity even further when the products are 
chosen from three non-orthogonal states { P o, pi, p 2 } in 
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a two-dimensional Hilbert space. Moreover, the effect 
arises from a competition between a QC capacity with an 
asymmetric probability distribution, and a CQ capacity 
with a 50/50 distribution. This suggests, in particular, 
that the capacity as a function of all possible inputs has 
a large fiat region near its maximum. It may be possi- 
ble to exploit this flatness and use a variety of alphabet 
distributions with only a small sacrifice in capacity. 
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FIG. 1. The x-z plane of the maximally shifted depolarizing 
channel $d with radius /1 = 0.5. The images of the optimal 
vertical and horizontal ensembles are indicated by • and o 
respectively. 



FIG. 2. The x-z plane of the amplitude damping channel 
$Amp with /i = 0.5. The images of the optimal vertical and 
horizontal ensembles are indicated by • and O respectively. 
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FIG. 3. The x-z plane of the stretched channel, with 
fi = 0.5, showing the optimal inputs and their images. 



FIG. 4. The x-z plane of the squeezed channel with fi = 0.5, 
q = 0.435 showing an ensemble on a skew line and its reflec- 
tion. If x is not optimized at the crossing, averaging to obtain 
a 4-state ensemble will increase x 
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